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(54) Speech coding device 



(57) A speech coding device in which an excitation 
signal of speech signals is expressed as a sum of a plu- 
rality of pulse strings, and positions of the pulse strings 
are selected from predetermined pulse position candi- 
dates to determine the excitation signal so that distor- 
tion between an input speech signal and a reproduced 
speech signal obtained by exciting a synthetic filter 
using the excitation signal may be minimized, resulting 
in obtaining reproduced speech signals with high quality 
in a small operational amount. In a pulse searcher (58), 
a. pulse generating section (11 to 15) outputs a plurality 
of pulse strings, and a pulse searching section (21 to 
24, 31 to 34, 41, 42) sequentially searches the pulse 
strings to determine the positions of the plurality of 
pulse strings constituting the excitation signal. One 
pulse searching section (21 to 24) searches using a 
Viterbi algorithm. Another pulse searching section (31 
to 34) preliminarily searches in a tree shape of pulse 
position candidates. Another pulse searching section 
(41 , 42) searches every pulse position candidate group. 
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Description 

BACKGROUND OF THE INVENTION 

The present invention relates to a speech coding device capable of determining an excitation signal so as to mini- 
mize distortion between a reproduction speech signal and an input speech signal, and more particularly to a speech 
coding device for coding speech signals with high speech quality by a low operational amount 

Description of the Related Art 

As a conventional coding system for speech signals at low bit rates of equal to or less than 4.8 kbits/sec. for exam- 
ple, a CELP (code-excited linear prediction) coding system has been known, as disclosed in "Code-Excited Linear Pre- 
diction: High-Guality Speech At Very Low Bit Rates", by M.R. Schroeder and B. S. Atal, Proc. ICASSP, pp. 937-940, 
1985 (the first Document), and "Improved Speech Quality And Efficient Vector Quantization In CELP", by W.B. Kleijin, 
D.J. Krasinski and R.H. Ketchum, Proc. ICASSP, pp. 155-158, 1988 (the second Document). 

In this CELP coding system, when coding on a transmitter side, first, spectral parameters representing spectral 
characteristics of speech signal are extracted from the speech signal using a LPC (linear predictive ceding) analysis, 
for example, every frame of 20 ms composed of the speech signals. Further, the frame is divided into, for example, 5 
ms of subframes, and parameters (a delay parameter and a gain parameter corresponding to a pitch cycle) are 
extracted based on an excitation signal every frame using an adaptive codebook. 

In the CELP coding system, the speech signals of the above described subframes are predicted from the adaptive 
codebook, and the optimum random code vector is selected from a random codebook (a vector quantized codebook) 
consisting of predetermined kinds of noise signals to calculate the optimum gain, resulting in quantizing the excitation 
signal. 

On this occasion, the optimum random code vector is selected so that an error power between the input speech 
signal and the reproduced speech signal synthesized by considering the selected random code vector as the excitation 
signal may be minimized. The gain and the index representing the kind of the selected random code vector, and the 
foregoing spectral parameter and the parameter of the adaptive codebook are combined in a multiplexer to output a 
combination of the codes from an output terminal for transmitting. 

A decoding procedure on a receiver side is conducted in a conventional manner and the detailed description 
thereof can be omitted for brevity. 

Further, in order to reduce a memory amount and an operational amount in the CELP coding system, a conven- 
tional fast coding method has been proposed, as disclosed in "Fast CELP Coding Based On Algebraic Codes", by J-R 
Adoul, R Mabilleau, M. Delpratand S. Morissette, Proc. ICASSP, pp. 1957-1960, 1987 (the third document). 

Next, a conventional excitation signal search method using pulse strings produced in an algebraic manner as an 
excitation signal in a CELP coding system will be described. 

In this search method, an excitation signal is expressed in the form of a sum of pulse strings selected from a plu- 
rality of channels. The pulse strings are selected from pulse candidate positions predetermined every each channel. 
The amplitude of each pulse is only of a polarity. For example, when a subframe length sampled at 8 kHz is 5 ms (a 
sample number N = 8k x 5m = 40), an excitation signal per subframe is expressed, for example, by a sum of P = 5 
number of single pulses selected from P = 5 number of channels. In this instance, each of the P = 5 channels has M (= 
N/P = 40/5) = 8 number of predetermined pulse candidate positions. 

The optimum excitation signal can be searched so that the distortion between the input speech signal and the 
reproduced speech signal obtained by exciting a synthetic filter using the excitation signal may be minimized. Now, 
when using the excitation signal as the pulse string, the minimization of the distortion between the input speech signal 
and the reproduced speech signal becomes equivalent to the maximization of the following formula (1). 

p-1 

{^a(i),d[m(0]} 2 

E <*>=FM (D 

I>[m(i),mG)] + 2 ■ £ S a (').a«. <«m(i),m(j)] 

(=0 t=0j=i+1 

In this formula, a symbol a(i), [i = 0 P-1] represents " 1" or "-1", a symbol <|>(i, j). [i, j = 0 N-1] represents an auto- 
correlation function responsive to an impulse in a synthetic filter, and a symbol d(i), [i = 0, N-1] represents a target 
signal obtained from an input speech signal and an impulse response signal. A symbol k can be resulted from "m(i)*' [i 
= 0, .... P-1] representing an excitation signal and can be transmitted at "(1 + log M) x P " bits. 

The search according to an evaluation function of formula (1) can be carried out sequentially one by one using P- 
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times loops. 

In the above conventional speech coding system, the excitation signal is expressed by the pulse string of only the 
polarity in the search method of the excitation signal. The search of this pulse position is sequentially implemented one 
by one against the entire candidates, and the operational amount in the searching turns out very much. 

On the other hand, when performing a preliminary selection of the pulse positions to be searched in order to reduce 
the operational amount in the searching, the quantizing efficiency deteriorates and the reproduced speech signal qual- 
ity is degraded. 

SUMMARY OF THE INVENTION 

10 

It is therefore an object of the present invention to provide a speech coding device in view of the aforementioned 
problems of the prior art, which is capable of searching the optimum pulse string representing an excitation signal in a 
low operational amount to obtain a reproduction speech with high quality. 

In accordance with one aspect of the present invention, there is provided a speech coding device, in which an exci- 

15 tation signal of speech signals is expressed as a sum of a plurality of pulse strings, and positions of the pulse strings 
are selected from predetermined pulse position candidates to determine the excitation signal so that distortion between 
an input speech signal and a reproduced speech signal obtained by exciting a synthetic fflter using the excitation signal 
may be minimized, comprising means for generating a plurality of pulse strings; and means for searching the pulse 
strings sequentially every pulse string using a Viterbi algorithm to determine the positions of the plurality of pulse strings 

20 constituting the excitation signal. 

In accordance with another aspect of the present invention, there is provided a speech coding device, in which an 
excitation signal of speech signals is expressed as a sum of a plurality of pulse strings, and positions of the pulse strings 
are selected from predetermined pulse position candidates to determine the excitation signal so that distortion between 
an input speech signal and a reproduced speech signal obtained by exciting a synthetic fBter using the excitation signal 

25 may be minimized, comprising means for generating a plurality of pulse strings, pulse position candidates of the pulse 
strings being expressed in a tree shape; and means for searching the pulse strings sequentially every pulse string by a 
preliminary searching to determine the positions of the plurality of pulse strings constituting the excitation signal. 

In accordance with a further aspect of the present invention, there is provided a speech coding device, in which an 
excitation signal of speech signals is expressed as a sum of a plurality of pulse strings, and positions of the pulse strings 

30 are selected from predetermined pulse position candidates to determine the excitation signal so that distortion between 
an input speech signal and a reproduced speech signal obtained by exciting a synthetic filter using the excitation signal 
may be minimized, comprising means for generating a plurality of pulse strings, pulse position candidates of the pulse 
strings being divided into groups; and means for searching the pulse strings sequentially every pulse position candidate 
group to determine the positions of the plurality of pulse strings constituting the excitation signal. 

35 

BRIEF DESCRIPTION OF THE DRAWINGS 

The objects, features and advantages of the present invention will become more apparent from the consideration 
of the following detailed description, taken in conjunction with the accompanying drawings, in which: 

40 

Fig. 1 is a block diagram of a speech coding device according to one embodiment of the present invention; 

Fig. 2 is a block diagram of a first embodiment of a pulse searcher shown in Fig. 1 ; 

Fig. 3 is a block diagram of a second embodiment of a pulse searcher shown in Fig. 1 ; and 

Fig. 4 is a block diagram of a third embodiment of a pulse searcher shown in Fig. 1 . 

45 s 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Referring now to the drawings, in Fig. 1 , there is shown a speech coding device according to one embodiment of 
the present invention. 

so In Fig. 1 , the speech coding device comprises a frame divider 51 , a subframe divider 52, a spectral parameter cal- 

culator 53, a spectral parameter quantizer 54, a filter factor calculator 55 of a (human auditory) perceptual weighting 
synthetic fitter, a (human auditory) perceptual weighter 56, an adaptive codebook searcher 57, a pulse searcher 58, a 
gain codebook searcher 59, and a multiplexer (MUX) 50. 

More specifically; first, speech signals input from an input terminal are divided, for example, every frame of 20 ms 

55 in the frame divider 51 and are further divided, for example, every subframe of 5 ms shorter than 20 ms of the frame in 
the subframe divider 52. 

The spectral parameter calculator 53 cuts out speech using a frame of, for example, 10 ms longer than a subframe 
length (in this case, 5 ms in the case of sampling at 8 kHz with a sampling number N = 40) against the speech signals 
of at least one subframe and it is assumed that the spectral parameter calculator 53 calculates spectral parameters by 
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a predetermined dimensional number L of, for example, ten degrees (L = 10). 

For the calculation of the spectral parameters, a well-known LPC analysis can be used. 

Further, the spectral parameter calculator 53 converts linear predictive factors a(i),[i = 1 L] into LSP (line spec- 
trum pair) parameters adaptive to a quantization and an interpolation. For the conversion from the linear predictive fac- 
5 tors into the LSP parameters, a paper "Speech Data Compression By LSP Speech Analysis-Synthesis Technique", by 
N. Sugamura and F. Itakura, IECE J64-A, pp. 599-606, 1981 (the fourth document) can be used. The linear predictive 
factors are output to the filter factor calculator 55, and the LSP parameters are to the spectral parameter quantizer 54. 

The spectral parameter quantizer 54 quantizes the LSP parameters effectively For this quantization of the LSP 
parameters, well-known quantizing methods can be used. For example, Japanese Patent Application Laid-Open Publi- 
jc cation No. 4-1 71 500 (the fifth document) or the like can be referred, and the description thereof can be omitted for brev- 
ity. The spectral parameter quantizer 54 further converts the quantized LSP parameters into the linear predictive factors 

a(i),[i = 1 LJ to output the obtained linear predictive factors to the filter factor calculator 55 and also outputs codes 

representing code vectors of the quantized LSP parameters to the multiplexer 50. 

The filter factor calculator 55 inputs the linear predictive factors before the quantization from the spectral parameter 
is calculator 53 and the quantized linear predictive factors from the spectral parameter quantizer 54 and calculates factors 
of a perceptual weighting filter expressed by formula (2) to output the calculated factors to the perceptual weighter 56. 
The filter factor calculator 55 further outputs factors of a perceptual weighting synthetic filter consisting of a linear pre- 
dictive synthetic filter and a perceptual weighting filter to the adaptive codebook searcher 57, and the pulse searcher 
58 and the gain codebook searcher 59. 

20 

L 

Hw(z) = S=3 (2) 

25 l-£aO).R1 l .z- 1 

In this formula, R1 and R2 represent weighting factors for controlling a perceptual weighting amount, and, for example, 
R1 = 0.9 and R2 = 1 .0 are applied. 

30 The perceptual weighter 56 reproduces the weighting filter from the factors of the perceptual weighting filter sup- 
plied from the filter factor calculator 55 and weights the input signal to output perceptual weighted input signal X(n) to 
the adaptive codebook searcher 57, the pulse searcher 58 and the gain codebook searcher 59. 

The adaptive codebook searcher 57 cuts out a segment of a delay d (a pitch cycle) from a past excitation signal 
and repeatedly connects the cutout segments until the connected segments have the subframe length N to produce the 

35 adaptive code vector Ad(n) corresponding to the delay d, and selects the pitch cycle d and the adaptive code vector 
Ad(n) so that an error power between a perceptual weighting input signal and a perceptual weighting synthetic signal 
obtained using the produced adaptive code vector Ad(n) may be minimized. 

Further, the adaptive codebook searcher 57 outputs a code representing the selected pitch cycle d to the multi- 
plexer 50, outputs the selected adaptive code vector Ad(n) to the gain codebook searcher 59, and outputs the percep- 

40 tual- weighted and selected adaptive code vector SAd(n) to the pulse searcher 58. 

The pulse searcher 58 calculates the optimum pulse string Cj(n) using the factor of the perceptual weighting syn- 
thetic filter, the perceptual weighted input signal X(n), and the perceptual- weighted and selected adaptive code vector 
SAd(n) and outputs the calculated optimum pulse string Cj(n) to the gain codebook searcher 59 and the multiplexer 50. 
According to the present invention, the pulse searcher 58 includes a plurality of embodiments and their detailed 

45 description will be described later. 

The gain codebook searcher 59 inputs the selected adaptive code vector Ad(n) from the adaptive codebook 
searcher 57, the optimum pulse string Cj(n) from the pulse searcher 58, the perceptual weighted input signal X(n) from 
the perceptual weighter 56 and the factors of the perceptual weighting synthetic filter from the filter factor calculator 55, 
and produces the perceptual weighting synthetic filter. 

so The gain codebook searcher 59 then calculates an excitation signal Ek(n) as a linear sum of the adaptive code vec- 
tor Ad(n) and the optimum pulse string Cj(n), as expressed in formula (3), and selects a gain code vector so that an error 
power between the perceptual weighted input signal and the perceptual weighted synthetic signal obtained by driving 
the perceptual weighting synthetic filter using the calculated excitation signal Ek(n) may be minimized. The gain code- 
book searcher 59 outputs the selected gain code vector to the multiplexer 50. 

55 

Ek(n)=Gk(1) • Ad(n)-Gk(2) • Cj(n) (3) 

In formula (3), Gk(1) and Gk(2) represent k-th two-dimensional gain code vectors. 

The multiplexer 50 inputs the codes representing code vectors of the quantized LSP parameters from the spectral 
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parameter quantizer 54, the code representing the selected pitch cycle d from the adaptive codebook searcher 57, the 
code representing the pulse string from the pulse searcher 58 and the code representing the gain code vector from the 
gain codebook searcher 59, and combines the input codes to output the combined codes to an output terminal. 

Figs. 2 to 4 show the first to third embodiments of the pulse searcher 58 of the speech coding device shown in Fig. 
5 1 corresponding to the speech coding device according to the first to third embodiments of the present invention, which 
are characterized by the first to third embodiments of the pulse searcher 58. 

The first embodiment of the pulse searcher 58 of the speech coding device shown in Fig. 1 will be described with 
reference to Fig. 2. 

In Fig. 2, the pulse searcher 58 includes a target signal generating circuit 10, first, second, third, fourth and fifth 
io pulse generating circuits 11 to 15, a pulse string coding circuit 20, and first, second, third and fourth Viterbi searching 
circuits 21 to 14. 

The pulse searcher 58 produces an excitation signal which is expressed as a sum of pulse strings selected from a 

plurality of channels. The pulse strings are selected from pulse position candidates predetermined every each channel. 

The amplitude of each pulse is only of a polarity. For example, in the case of a subframe length of 5 ms and sampling 
75 at 8 kHz (a sampling number N = 40), it is assumed that an excitation signal per subframe is expressed as a sum of, for 

example, P (= 5) number of single pulses selected from P (= 5) number of channels. In this instance, each of the P (= 

5) number of channels has predetermined M (= IM/P = 40/5 = 8) number of pulse position candidates. 

In Fig. 2, the target signal generating circuit 10 inputs the factors of the perceptual weighting synthetic filter and 

constitutes the perceptual weighting synthetic filter. Further, the target signal generating circuit 1 0 inputs the perceptual 
20 weighted input signal X(n) from the perceptual weighter 56 and the perceptual- weighted and selected adaptive code 

vector SAd(n) from the adaptive codebook searcher 57 and calculates an error signal z(n) according to formula (4) 

wherein a symbol G is expressed by formula (5). 



25 



30 



z(n) = X<n) - G • SAd(n) (4) 

N-1 

£X(n)SAd(n) 

(5) 

£SAd(n) 2 

i=0 



Further, the target signal generating circuit 10 filters the error signal z(n) backwards using the perceptual weighting 
synthetic filter to prepare a target signal d(n), produces an auto-correlation functions <j>(i, j) responsive to an impulse in 
35 the perceptual weighting synthetic filter, and outputs the target signal d(n) and the auto-correlation functions 4>(i, j) to the 
first, second, third and fourth Viterbi searching circuits 21, 22, 23 and 24. 

The first pulse generating circuit 1 1 places single pulses against predetermined 8 pulse position candidates (e.g. , 
N = 0, 5. 10, 15, 20, 25, 30, 35) and outputs these pulses to the first Viterbi searching circuit 21 . 

The second pulse generating circuit 12 places single pulses against predetermined 8 pulse position candidates 
40 (e.g. , N = 1 , 6, 1 1 , 1 6, 21 , 26, 31 , 36) and similar to the first pulse generating circuit 1 1 , outputs these pulses to the first 
Viterbi searching circuit 21 . 

The third pulse generating circuit 13 places single pulses against predetermined 8 pulse position candidates (e.g. 
, N = 2, 7, 12, 17, 22, 27, 32, 37) and outputs these pulses to the second Viterbi searching circuit 22. 

The fourth pulse generating circuit 1 4 places single pulses against predetermined 8 pulse position candidates (e.g. , 
45 N = 3, 8, 13, 18, 23, 28, 33, 38) and outputs these pulses to the third Viterbi searching circuit 23. 

Similarly, the fifth pulse generating circuit 15 places single pulses against predetermined 8 pulse position candi- 
dates (e.g., N = 4, 9, 14, 19, 24, 29, 34, 39) and outputs these pulses to the fourth Viterbi searching circuit 24. 

The pulse position candidates in the first to fifth pulse generating circuits 11 to 1 5 are one example and, of course, 
another positioning can be possible in the pulse position candidates. 
so The searching of the pulse strings in the first to fourth viterbi searching circuits 21 to 24 is carried out by selecting 

the optimum combination of the signals supplied from the two pulse generating circuits on the basis of a Viterbi algo- 
rithm. 

In the first Viterbi searching circuit 21 , when the 8 pulse signals (the pulse position m(1) = 1,6, 11 , 16, 21, 26, 31 , 
36) output from the second pulse generating circuit 12 are placed, the optimum combinations with the 8 pulse signals 
55 (the pulse position m(0) = 0, 5, 10, 15, 20, 25. 30,35) output from the first pulse generating circuit 1 1 are selected based 
on the Viterbi algorithm. 

That is, the first Viterbi searching circuit 21 adds the 8 pulse signals output from the first pulse generating circuit 1 1 
to each of the 8 pulse signals output from the second pulse generating circuit 1 2, and selects one pulse signal from the 
obtained 8 pulse signals so that an evaluation value P (in this case, P = 2) in formula (1 ) may be maximum. As a result, 
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the 8 selected pulse signals including the pulse position candidates of the second pulse generating circuit 12 are 
obtained as the candidates and these candidates are output to the second Vrtertoi searching arourt 22. 

In the second Viterbi searching circuit 22. when the 8 pulse signals (the pulse position m(2) = 2 7, 12, 7, 22, 27 
32 37)7uS frorr , he third pulse generating circuit 1 3 are placed, the optimum combinations wrth the 8 pulse s,gnals 
output ffi^ W«bi searching circuit 21 are selected (in this case. P = 3) in the same manner as described 
a^e, a^d the s2cted pulse signals induding the pulse position candidates of the third pulse generating arcuri 13. 
nhtained as the candidates are output to the third Viterbi searching circuit 23. 

nt ^ thSd V^rbi searching rircuit 23. a searching is executed (in this case P - 4) .n * s 
described above, and the selected pulse signals including the pulse position candidates (the puis •P^'^£J£ 
13 1 8 23 28 33 38) of the fourth pulse generating circuit 14 are obtained as the candidates, and these cand.dates 

ing f^SS££**~ Oh. pulse position m(4) = 4. 9. 14. 19. 24. 2* 34. 39^2^^ £ 
curt 1 5 are obtained as the candidates, and one pulse signal is finally selected from the obta.ned signals so that the 
^^^t^ case, P = 5) in formula (1) may be maximum. The selected pulse signal is output to the pulse 

SW 7nSmbSfme 0 nt, any connection between the pulse generating circuits 11 to 15 and the Viterbi searching^ 
cuits 21 to 2 TSn bTposs Me. For example, besides the above described connection, priority of each pulse generating 
circuit I dettrnSby the evaluation vie P (in this case. P = 1) in formula (1). and the pulse generating courts 1 1 
to 1 5 mav be connected to the Viterbi searching circuits 21 to 24 in the pnorrty order. 

X pulS sfrS coding circuit 20, codes are produced from the P (= 5) number of pulse positions const.tut.ng the 
pjT^StjTZ^. fourth Viterbi searching circuit 24. The produced codes are output to the multiplexer 50 and 
the Dulse sianal is supplied to the gain codebook searcher 59. ^^^«k^ 
Tne second embSiment of the pulse searcher 58 of the speech coding device shown .n Rg. 1 w..l be descnbed 

" ^ i? Fig Tt he pulse searcher 58 includes a target signa. generating circurt 10. first, seconded fourth and frf* 
pjTg^m dieuit. 1 1 to 1 5. a pulse string coding circurt 20. and first, second, third and fourth prehm.nary search- 

'Sff e^ment. as shown in Fig. 3. the second embodiment of the pulse searcher 58 ^^ construe- 
3 o tion as the first embodiment shown in Fig. 2. except that the first to fourth prel.rn.nary searching arcurtsSI to_ 34 are 
redTnsteadll L first to fourth Viterbi searching circuits 21 to 24. Thus, the description of the same parts as those of 
the f irst embodiment can be omitted for brevity. 

SrSal generating circuit 1 0 outputs the target signal d(n) and the auto-correlat,on funct.on * (.. j) to the 
first second third and fourth preliminary searching circuits 31. 32. 33 and 34. 
35 SS second third, fourth and fifth pulse generating circuits 1 1 to 1 5 output the pulses to the f .rsU.rst, second^ 
M^Zn^LysevcKnc* circurts 31 to 34. respectively, in thesame manner asthef.rst embod.ment shown 

R in fhis embodiment a search of pulse strings is carried out by placing the pulses strings in a tree shape obtainal 
byincrea^ 

4 o The first preliminary searching circuit 31 preliminarily selects Q (= 8) number of puse >^^«» M 

64} number of Dulse signals of a combination of M (= 8) number of pulse signals (the pulse position m(0) - 0. 5. 10.15. 
To 25 3^ 35) ou^ fromL first pulse generating circurt 1 1 and of M (= 8) number of pulse signals (the pulse position 
m(1) - f el T^1 26. 31 . 36) output from the second pulse generating circuit 1 2 so that the evaluation value P (,n 
this case P = 2) in formula (1) mayTe maximum, and outputs the selected pulse signals to the second prehm.nary 

45 ^tes^ preliminary searching circuit 32 preliminarily selects Q (= 8) number of pulse signal, .from the Q x M 
(- 8 = 64) number of pulse signals of a combination of M (= 8) number of pulse signals (the pu se posrt.on m(2 - 
2 7 12 1 7 22 2 7i 37) output from the third pulse generating circurt 1 3 and of Q (= 8) number of pulse signals pre- 
MrJnarily select ■ » f irst preliminary searching circurt 31 so that the evaluation value P (in th. case, P = 3) ,n for- 
Zl^f Tmav be maximum and outputs the selected pulse signals to the third preliminary search.ng circurt 33. 

. ^ thTThSp^minSy searing circurt 33 a preliminary searching is implemented in the same manner as 
descled^ove to select L Q (= 8) number of pulse signals from the Q x M (= 64) number of pulse signal including 
fh^SSs Tthe putee position m 3) = 3. 8. 13. 18. 23, 28, 33, 38) and the signals preliminarily selected in the second 
St 32 so that the evaluation value P (in this case. P = 4) in formula (1) may be max-mum. and 
the selected pulse signals are output to the fourth preliminary searching circuit 34. MlartfW1 - 
SirSy the fourth preliminary searching circuit 34 executes a preliminary search.ng so as to '^lly select one 
oulse SSrom the Q x M (= 64) number of pulse signals including the signals (the pulse posrtion rn(4) = 4, 9. 4, 19. 
£ 29 ITS and the signals preliminarily selected in the third preliminary searching circurt 33 so that 
vaiuePOnScase P = 5) in formula (1) may be maximum. The selected pulse s.gnal is output to the pulse string cod- 
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* * ing circuit 20. 

The pulse string coding circuit 20 outputs the produced codes to the multiplexer 50 and the selected pulse signal 
to the gain codebook searcher 59 in the same manner as the first embodiment described above. 

The third embodiment of the pulse searcher 58 of the speech coding device shown in Fig. 1 will be described with 
5 reference to Fig. 4. 

In Fig. 4, the pulse searcher 58 includes a target signal generating circuit 10, first, second, third, fourth and fifth 
pulse generating circuits 11 to 1 5, a pulse string coding circuit 20, and first and second searching circuits 41 to 42. 

In this embodiment, as shown in Fig. 4, the third embodiment of the pulse searcher 58 has the same construction 
as the second embodiment shown in Fig. 3, except that the first and second searching circuits 41 to 42 are used instead 
10 of the first to fourth preliminary searching circuits 31 to 34. Thus, the description of the same parts as those of the sec- 
ond embodiment can be omitted for brevity. 

The target signal generating circuit 10 outputs the target signal d(n) and the auto-correlation function 4> (i, j) to the 
first and second searching circuits 41 and 42. 

The first to third pulse generating circuits 11 to 13 output the pulses to the first searching circuits 41 and the fourth 
15 and fifth pulse generating circuits 14 and 15 output the pulses to the second searching circuits 42. 

The first searching circuit 41 preliminarily selects, for example, Q (= 8) number of pulse signals from the M 3 (= 8 3 
= 512) number of pulse signals of a combination of M (= 8) number of pulse signals (the pulse position m(0) = 0, 5, 10. 
1 5, 20, 25, 30, 35) output from the first pulse generating circuit 1 1 , of M {= 8) number of pulse signals (the pulse position 
m(1) = 1, 6, 11, 16, 21, 26, 31, 36) output from the second pulse generating circuit 12, and of M (= 8) number of pulse 
20 signals (the pulse position m(2) = 2, 7, 12, 17, 22, 27, 32, 37) output from the third pulse generating circuit 13 so that 
the evaluation value P (in this case, P = 3) in formula (1) may be maximum, and the selected 8 pulse signals are output 
to the second searching circuit 42. 

The second searching circuit 42 finally selects one pulse signal from the Q x M 2 (= 8 x 8 2 = 51 2) number of pulse 
signals of a combination of M (= 8) number of pulse signals (the pulse position m(3) = 3, 8, 13,18, 23. 28. 33, 38) output 
25 from the fourth pulse generating circuit 14, of M (= 8) number of pulse signals (the pulse position m(4) = 4, 9. 14, 19, 
24, 29, 34, 39) output from the fifth pulse generating circuit 15. and of Q (= 8) number of pulse signals preliminarily 
selected in the first searching circuit 41 so that the evaluation value P (in this case, P = 5) in formula (1) may be maxi- 
mum. The selected pulse signal is output to the pulse string coding circuit 20. 

The pulse string coding circuit 20 outputs the produced codes to the multiplexer 50 and the selected pulse signal 
30 to the gain codebook searcher 59 in the same manner as the first embodiment described above. 

Further, in the third embodiment, a plurality of Viterbi searching circuits used in the first embodiment or a plurality 
of preliminary searching circuits used in the second embodiment may be used for the searching circuits to which a plu- 
rality of pulse generating circuits are connected. 

As described above, according to the present invention, in a speech coding device including a plurality of pulse 
35 searching circuits, when coding speech signals, position candidates of a plurality of pulse strings constituting the exci- 
tation signal are divided into groups, and the pulse searching circuits carry out the searching every group to determine 
the positions of the plurality of pulse strings. Hence, in the searching of the pulse strings constituting the excitation sig- 
nal, the operational amount can be reduced without deteriorating reproduction speech signal quality, resulting in repro- 
duced speech with high quality by a small operational amount. 
40 While the present invention has been described with reference to the particular illustrative embodiments, it is not to 
be restricted by those embodiments. 

It is to be appreciated that those skilled in the art can change or modify the embodiments without departing from 
the scope and spirit of the present invention. 

45 Claims 

1 . A speech coding device, in which an excitation signal of speech signals is expressed as a sum of a plurality of pulse 
strings, and positions of the pulse strings are selected from predetermined pulse position candidates to determine 
the excitation signal so that distortion between an input speech signal and a reproduced speech signal obtained by 

so exciting a synthetic filter using the excitation signal may be minimized, comprising: 

means (11 to 15) for generating a plurality of pulse strings; and 

means (21 to 24) for searching the pulse strings sequentially every pulse string using a Viterbi algorithm to 
determine the positions of the plurality of pulse strings constituting the excitation signal. 

55 

2. A speech coding device, in which an excitation signal of speech signals is expressed as a sum of a plurality of pulse 
strings, and positions of the pulse strings are selected from predetermined pulse position candidates to determine 
the excitation signal so that distortion between an input speech signal and a reproduced speech signal obtained by 
exciting a synthetic filter using the excitation signal may be minimized, comprising: 
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means (11 to 1 5) for generating a plurality of pulse strings, pulse position candidates of the pulse strings being 
expressed in a tree shape; and 

means (31 to 34) for searching the pulse strings sequentially every pulse string by a preliminary searching to 
determine the positions of the plurality of pulse strings constituting the excitation signal. 

A speech coding device, in which an excitation signal of speech signals is expressed as a sum of a plurality of pulse 
strings and positions of the pulse strings are selected from predetermined pulse position candidates to determine 
the excitation signal so that distortion between an input speech signal and a reproduced speech signal obtained by 
exciting a synthetic filter using the excitation signal may be minimized, comprising: 

means (1 1 to 1 5) for generating a plurality of pulse strings, pulse position candidates of the pulse strings being 
divided into groups; and . . ^ 

means (41 , 42) for searching the pulse strings sequentially every pulse position candidate group to determine 
the positions of the plurality of pulse strings constituting the excitation signal. 
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(54) Speech coding device 

(57) A speech coding device in which an excitation 
signal of speech signals is expressed as a sum of a plu- 
rality of pulse strings, and positions of the pulse strings 
are selected from predetermined pulse position candi- 
dates to determine the excitation signal so that distor- 
tion between an input speech signal and a reproduced 
speech signal obtained by exciting a synthetic filter 
using the excitation signal may be minimized, resulting 
in obtaining reproduced speech signals with high quality 
in a small operational amount, tn a pulse searcher (58), 
a pulse generating section (11 to 15) outputs a plurality 



of pulse strings, and a pulse searching section (21 to 
24, 31 to 34, 41, 42) sequentially searches the pulse 
strings to determine the positions of the plurality of 
pulse strings constituting the excitation signal. One 
pulse searching section (21 to 24) searches using a 
Viterbi algorithm. Another pulse searching section (31 
to 34) preliminarily searches in a tree shape of pulse 
position candidates- Another pulse searching section 
(41 , 42) searches every pulse position candidate group. 
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